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AN  EFFICIENT  ALGORITHM  FOR  A  VISIBILITY-BASED 
SURVEILLANCE-EVASION  GAME 

RYO  TAKEI,  RICHARD  TSAI,  ZHENGYUAN  ZHOU,  AND  YANINA  LANDA 


Abstract.  We  present  an  algorithm  which  computes  the  value  function  and  optimal  paths  for 
a  two-player  static  game,  where  the  goal  of  one  player  is  to  maintain  visibility  of  an  adversarial 
player  for  as  long  as  possible,  and  that  of  the  adversarial  player  is  to  minimize  this  time.  In  a 
static  game  both  players  choose  their  controls  at  initial  time  and  run  in  open-loop  for  t  >  0  until 
the  end-game  condition  is  met.  Compared  to  closed-loop  (feedback  strategy)  games  which  require 
solving  a  PDE  in  high  dimensions,  we  demonstrate  that  the  static  game  can  be  solved  directly 
in  the  state  space  by  the  proposed  PDE-based  technique.  This  results  in  significant  savings  in 
both  memory  and  computational  cost,  at  the  expense  of  a  simpler  information  pattern  that  is 
more  conservative  towards  one  player.  In  addition,  we  describe  how  this  algorithm  can  be  easily 
generalized  to  games  with  multiple  evaders.  Applications  to  target  tracking  and  an  extension  to  a 
feedback  control  game  are  also  presented. 


1.  Introduction 

Consider  the  setting  where  a  mobile  agent  is  required  to  visually  track  a  mobile  target  in  an 
environment  containing  obstacles  that  block  the  agent’s  line  of  sight.  For  the  mobile  agent  to 
maximize  the  time  interval  in  which  it  maintains  visibility  of  the  target,  a  conservative,  worst-case, 
strategy  is  to  assume  that  the  target  actively  seeks  occlusion  behind  the  obstacles.  Hence,  the 
problem  may  be  posed  as  a  pursuit-evasion-type  game:  the  game  terminates  the  first  instance  in 
time  the  pursuer  loses  sight  of  the  evader,  the  pursuer’s  objective  is  to  prolong  the  game  as  much 
as  possible,  and  the  evader’s  objective  is  to  terminate  the  game  as  quickly  as  possible.  We  shall  call 
this  type  of  game  a  visibility-based  surveillance- evasion  game ,  or  simply  a  surveillance-evasion  game. 

In  this  work,  we  investigate  optimal  motion  strategies  for  two  or  more  players  moving  with  inho¬ 
mogeneous  and/or  anisotropic  velocities  in  a  domain  containing  obstacles.  The  main  contribution 
is  a  computationally  and  memory  efficient  algorithm  for  constructing  the  players’  optimal  paths 
for  the  static  surveillance-evasion  game.  In  this  game  the  pursuer  initially  chooses  its  control  for 
the  remaining  time,  assuming  that  the  evader  would  always  counter  with  its  best  control.  As  we 
shall  see,  the  particular  information  pattern  of  the  static  game  allows  for  a  decomposition  of  the 
problem  into  simpler  optimal  control  problems  in  the  underlying  domain,  which  can  be  implemented 
in  parallel.  Our  approach  contrasts  non-anticipative  or  feedback  strategy  games,  which  require  so¬ 
lutions  to  Hamilton- Jacobi-Issacs  partial  differential  equations  in  the  joint  configuration  space  with 
a  computational  complexity  which  increases  exponentially  with  the  number  of  participating  players. 

The  efficiency  of  the  algorithm  allows  for  various  applications  and  extensions.  For  example,  the 
algorithm  generalizes  easily  to  the  game  involving  multiple  evaders  while  maintaining  a  complexity 
which  is  linear  in  the  number  of  players.  Furthermore,  our  algorithm  may  be  applied  iteratively  in 
small  time  increments  to  provide  efficient  solutions  to  a  class  of  dynamic  target  tracking  problems. 

1.1.  Previous  work. 

1.1.1.  Related  problems  involving  surveillance  and  visibility.  Surveillance-evasion  games  were  studied 
geometrically  in  [L079,  LB75,  Gre87],  where  the  game  terminates  the  instance  the  evader  exits  the 
pursuer’s  local  “detection  circle.”  It  is  unclear,  however,  how  these  geometric  formulations  could  be 
applied  in  the  context  of  the  more  general  visibility-based  surveillance-evasion  game.  Furthermore, 
the  geometric  nature  of  the  derivations  makes  generalizations  to  arbitrary  players  dynamics  and  the 
presence  of  state  constraints  difficult  to  analyze. 
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In  a  domain  with  polygonal  obstacles,  the  numerical  methods  for  the  visibility  based  surveillance- 
evasion  game  with  constant,  isotropic  velocities  were  studied  in  [BH08a].  There,  the  authors  geo¬ 
metrically  characterized  sets  called  “decidable  regions”  consisting  of  all  possible  pursuer  initial  states 
where  the  outcome  of  the  game  is  known,  given  the  initial  state  of  the  evader.  The  same  authors,  in 
[BH08b],  also  considered  computing  optimal  trajectories  of  the  surveillance-evasion  game  backwards 
in  time  by  tracing  particular  characteristic  curves  from  the  terminal  condition  set. 

We  also  mention  another  class  of  problems,  called  visibility  based  pursuit-evasion,  which  concerns 
the  motion  planning  of  an  agent  that  seeks  out  an  initially  hidden  mobile  target  in  a  simply  connected 
polygonal  domain  [SY92,  TL08,  GLL+97].  The  simplified  representation  of  the  environment  using 
polygons  is  a  major  limitation  of  the  algorithms  based  on  computational  geometry  and  combinatorics. 
Furthermore,  the  extension  of  these  algorithms  to  three  dimensional  problems  may  be  extremely 
complicated.  In  [LT08],  the  authors  present  a  robust  algorithm  for  reconstructing  the  visible  portions 
of  the  obstacles’  boundaries  as  well  as  identifying  visible  regions  of  a  bounded  domain  provided 
a  vantage  point  and  a  dense  collection  of  points  sampled  from  the  surfaces  of  solid  obstacles  in 
the  domain.  The  formulation  may  be  applied  to  arbitrary  bounded  regions  independent  of  their 
geometry.  The  resulting  visibility  function  can  easily  be  transformed  into  the  level-set-based  visibility 
function  [CT05,  TCO+04],  which  is  described  below  and  is  used  as  a  basis  of  the  algorithms  described 
in  this  paper. 

1.1.2.  Pursuit- evasion  type  game.  One  of  the  initial  applications  of  the  pursuit-evasion  games  was 
missile  guidance  systems  formulated  by  R.  Isaacs  in  the  celebrated  monograph  [Isa65].  Other  ap¬ 
plications  of  this  problem  include  security  and  surveillance  systems  as  well  as  search/rescue  efforts. 
The  proposed  strategies  could  be  used  by  autonomous  systems  or  by  human  searchers. 

In  Section  2.2,  we  outline  how  the  connection  between  differential  games  and  Hamilton- Jacobi 
equations  is  established.  The  key  idea  is  to  introduce  a  value  function  of  the  game,  which  formally 
satisfies  a  particular  PDE,  known  as  Hamilton- Jacobi-Isaacs  (HJI)  equation.  The  characteristic 
curves  of  the  HJI  equation  correspond  to  the  optimal  paths  of  the  players.  The  first  treatment 
of  such  equations  appeared  in  the  monologue  [Isa65].  Subsequently,  with  the  advent  of  the  notion 
of  viscosity  solutions  [CL83],  rigorous  theories  regarding  the  existence  and  uniqueness  of  solutions 
(the  value  functions)  to  HJI  equations  were  established;  see  [ES84]  and  the  references  therein. 
Numerical  results  have  been  developed  to  compute  the  value  and  optimal  paths  for  various  games, 
see  e.g.  [BFS99].  In  practice,  the  high  dimensionality  of  the  HJI  equation  makes  this  approach 
tractable  only  on  a  modestly  sized  grid  with  a  small  number  of  players  -  usually  two  players  in  at 
most  a  two  dimensional  domain. 

1.1.3.  Target  tracking.  A  related  target  tracking  problem  consists  of  a  combination  of  different  chal¬ 
lenging  problems  that  have  been  separately  studied  in  robotics.  Visibility  and  collision  constraints 
must  be  satisfied  in  the  presence  of  uncertainties  in  the  positions  of  the  observer  and  the  target 
[LGBL97,  FGBLL01,  MCGT02,  BLJH05,  YL08].  The  observer  would  like  to  maintain  visibility 
of  the  target  while  avoiding  collisions  with  the  obstacles  [LGBL97].  When  the  target  is  fully  pre¬ 
dictable,  that  is,  its  trajectory  is  known,  a  dynamic  programming  approach  [Ber86]  can  be  used  to 
compute  the  shortest  path  for  the  observer.  If  the  target  is  only  partially  predictable,  this  approach 
fails.  Instead,  the  observer  chooses  the  motion  command  which  would  maximize  the  likelihood  that 
the  target  will  remain  visible  during  some  time  in  the  future  [LGBL97,  FGBLL01,  MCGT02].  In 
[LGBL02] ,  a  more  complicated  problem  of  tracking  unpredictable  targets  in  unknown  environments 
is  considered.  The  observer  is  able  to  reconstruct  a  local  map  of  the  environment  based  on  range 
data.  Then,  a  new  velocity  is  determined  using  a  combinatorial  algorithm  to  minimize  the  risk  of 
target  escaping  the  observer’s  view  by  crossing  an  occlusion  ray  created  by  the  obstacle.  The  gen¬ 
eral  problem  of  inferring  where  the  unpredictable  moving  targets  could  be  as  they  pass  out  of  view 
is  presented  in  [YL08].  The  main  idea  behind  [YL08]  is  to  introduce  information  spaces  that  ex¬ 
tract  and  maintain  combinatorial  data  obtained  by  robots  that  carry  sensors.  The  proposed  strategy 
[YL08]  can  be  easily  generalized  to  two  or  three  dimensional,  known  or  unknown,  multiply  connected 
domains,  as  long  as  the  connected  components  of  the  shadow  region  are  maintained. 

The  problem  of  target  tracking  in  an  arbitrary  domain  using  a  variational  model  was  proposed 
in  [CT05].  There,  the  pursuer’s  control  was  determined  by  optimizing  a  functional  composed  of  a 
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linear  combination  of  the  distance  between  the  two  players  and  the  visibility  area.  Gradient  descent 
was  employed  to  lead  the  pursuer  to  a  local  minimizer  of  the  functional. 

1.1.4.  Optimal  control  and  visibility  in  the  Hamilton- Jacobi  framework.  Our  approach  to  the  vis¬ 
ibility  based  surveillance-evasion  game  is  based  on  decomposing  the  problem  into  optimal  con¬ 
trol  problems.  Thus,  its  efficiency  hinges  on  that  of  the  solution  for  the  corresponding  optimal 
control  problems.  To  this  end,  we  exploit  recent  developments  in  numerical  Hamilton- Jacobi- 
Bellman  equation  solvers  to  determine  the  value  function  associated  with  the  optimal  control  problem 
[TCOZ03,  Set95,  BCD97a,  RT92,  Tsi95,  SV03,  GR85,  FF02] ;  optimal  controls  and  paths  can  then 
be  extracted  from  the  value  function  by  means  of  solving  an  ordinary  differential  equation  along  the 
characteristic  curves,  see  appendix  in  [BCD97a]. 

Another  useful  tool,  the  level-set  based  continuous  visibility  function  <p(-,  •)  [CT05,  TCO+04]  has 
the  property  that  for  any  two  points  x,^o, 

p(x,  xq)  <  0  <G>  x  is  occluded  from  xo, 

when  obstacles  are  suitably  defined  implicitly  as  a  level  set  function;  i.e.  p  is  zero  on  the  obstacle’s 
boundary,  is  negative  inside  and  positive  outside.  This  formulation  allows  for  accurate  description 
with  subcell  resolution  of  the  the  visibility  information  that  is  convenient  for  Boolean  operations  of 
the  described  regions  as  well  as  for  integration  and  differentiation.  Numerically,  p  can  be  computed 
in  O(N)  complexity  on  a  grid  with  N  nodes,  by  solving  a  Hamilton- Jacobi  equation  with  special 
sweeping  techniques  [TCO+04].  In  the  appendix,  we  extend  the  notion  of  a  visibility  function  to 
that  of  a  shadow  function: 

£(x,  xo)  <  0  <G>  x  is  occluded  from  xq  and  x  is  not  inside  an  obstacle. 

1.2.  Outline  of  article.  The  article  will  proceed  as  follows.  In  Section  2,  we  lay  out  all  necessary 
assumptions  as  well  as  give  a  precise  definition  of  a  static  information  pattern  for  a  two-player 
game.  Next,  in  Section  3,  we  present  the  theory  and  algorithm  for  the  static  surveillance-evasion 
game,  as  well  as  numerical  results  demonstrating  its  performance.  Applications  and  extensions  of  our 
algorithm  to  multiple  evaders,  target  tracking  and  feedback  control  games  are  presented  in  Section  4. 


2.  Information  patterns  and  game  definitions 


2.1.  Preliminary  definitions.  We  partition  an  open  and  bounded  ambient  space  Cl  into  the 
freespace  and  obstacles :  =  f2free  U  f20bs?  where  fJfree  is  open;  S20bs  serves  as  impenetrable  ob¬ 

stacles  to  all  players  and  blocks  the  line  of  sight  between  any  two  players.  For  a  vantage  point 
x  G  flfree  >  we  denote  Vx  C  fJfree  to  be  the  relatively  closed1  subset  in  fifree  of  all  points  that  are  not 
visible  from  x. 

We  shall  refer  to  the  two  competing  players  as  E  (for  ‘Evader’)  and  P  (for  ‘Pursuer’),  with  fixed 
initial  states  G  fJfree?  respectively.  Let  A  =  {&  |  ||d||2  =  lor(j  =  0}c  Mn  be  the  compact 

set  of  control  values  and  A  =  {measurable  a:  [0,  oc)  — »  A}  be  the  set  of  time-parametrized  controls. 
Let  xe  :  [0,  oc)  -A  Qfree  and  xp\  [0,  oc)  -A  fJfree  represent  the  time-evolution  of  E  and  P’s  states, 
respectively.  We  assume  the  players  move  according  to  the  following  geometric  dynamics: 


xE(t)  =  fE(xE(t),aE(t))aE(t)  folt>0  Le(0)  =  x°E 

xp(t)  =  fp(xp(t),ap(t))ap(t)  °  ’  pP(0)=aA 


where  fp  :  fhree  x  A  -A  M,  fp :  £2free  xJg!  are  speeds  assumed  to  satisfy  the  following: 

(Al)  /#,  fp  are  Lipschitz  continuous. 

(A2)  There  exist  constants  F\,  >  0  such  that  F\  <  /#(#,  <r),  fp(x,  a)  <  for  all  x  G  f2free 

and  a  G  A. 

(A3)  For  every  xp,xp  G  fJfree  the  interior  of  the  vectograms  {/e(^e,  cfe)^e  \  &e  £  7L\{0}}, 
{fp{xp1crp)crp  |  bp  G  A\{0}}  are  convex. 


Two  players  are  considered  occluded  if  they  lie  on  each  other’s  visibility  horizon ,  i.e.  the  line  segment  connecting 
the  players  that  is  tangent  to  an  obstacle  boundary. 
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Assumption  (Al)  implies  a  unique  solution  to  (1)  for  a  pair  of  controls  Gp  £  A,  ap  £  A  up  to 
some  finite  time;  we  refer  to  these  solutions  xp{-),xp{-)  as  paths  for  E  and  P,  respectively.  To  avoid 
cumbersome  notation,  we  will  not  write  the  explicit  dependence  of  xp(m)  on  ge  and  x°E,  and  similarly 
for  xp(-)  on  Gp  and  xP.  Assumption  (A2)  ensures  that  the  minimum  time  value  functions 

(2)  ue{x )  =  inf  inf{£  >  0  |  XE(t)  =  t}, 

cr^eA 

(3)  up {x)  =  inf  inf{£  >  0  |  xp{t)  =  x }, 

<jPeA 

for  the  single  player  optimal  control  problems  are  continuous.  Finally,  the  convexity  assumptions 
of  the  interior  of  the  vectograms  in  (A3)  guarantee  the  existence  of  optimal  controls  in  (2)  and 

(3)  and  closedness  of  optimal  paths  (under  uniform  convergence  over  bounded-time  intervals)  in 
an  unrestricted  state  space;  for  a  remark  on  the  existence  of  optimal  controls  for  a  restricted  state 
space,  see  Remark  2.2.  Note  that  the  zero  control  value  0  £  A  gives  the  players  the  option  to  stop. 
As  we  shall  see  later,  the  existence  of  the  zero  control  crucial  is  when  characterizing  the  optimal 
control  for  P. 

Remark  2.1.  It  can  be  shown  by  a  formal  derivation  that  up  satisfies  the  Hamilton- Jacobi  (HJ) 
partial  differential  equation  (PDE) 

(4)  max{VR£(x)  •  fp{x,  ct)g}  =  1, 

<j£A 

in  fJfree\{^£;}  and  similarly  for  up.  It  is  known  that  in  general,  even  under  the  assumptions  (Al)- 
(A3),  Lipschitz  continuous  solutions  of  (4)  with  typical  Dirichlet  boundary  conditions  are  non¬ 
unique.  However,  it  is  possible  to  characterize  a  generalized  solution,  called  a  viscosity  solution 
[CL83].  It  has  been  shown  that  the  viscosity  solution  to  (4)  coincides  with  the  value  function  up 
of  the  corresponding  optimal  control  problem  [Lio82].  The  advantage  of  characterizing  the  value 
function  as  a  HJ  PDE  is  the  availability  of  a  plethora  of  efficient  numerical  methods  known  to 
converge  to  the  viscosity  solution,  including  fast  sweeping- type  [TCOZ03,  RT92],  fast  marching- type 
[Set95,  Tsi95,  SV03]  and  semi-Lagrangian  methods  [GR85,  FF02].  See  also  [Mit08]  for  a  publicly 
available  Mat  lab  toolbox  for  computing  solutions  to  a  wide  class  of  HJ  equations. 

Remark  2.2.  In  a  restricted  state  space  (i.e.  f4free  7^  Mn),  the  optimal  controls  in  (2)  and  (3)  may  not 
exist,  even  under  the  assumptions  (A1)-(A3).  However,  from  the  definition  of  the  value  function, 
there  will  always  exist  controls  that  are  e-suboptimal ,  provided  the  value  function  is  finite.  For 
instance,  given  e  >  0,  there  exists  cr*  £  A  such  that  the  corresponding  path  x*E{-)  for  E  satisfies 

uE(x)  >  inf{£  >  0  |  Xp(t)  =  x}  —  e. 

To  simplify  the  presentation,  we  will  loosely  refer  to  an  “optimal  path”  as  a  path  that  corresponds 
to  an  optimal  control,  if  such  a  control  exists,  or  an  e-suboptimal  control,  if  no  optimal  control 
exists.  For  detailed  discussions  on  numerical  computations  of  optimal  trajectories  and  empirical 
demonstrations  of  its  computation  in  a  restricted  state  space,  we  refer  the  reader  to  Appendix  A  in 
[BCD97a]. 

2.2.  Feedback  strategies  and  the  value  function.  E’s  objective  is  to  choose  an  optimal  ge  £  A 
in  order  to  minimize  the  payoff,  while  P’s  objective  is  to  choose  an  optimal  gp  £  A  to  maximize 
the  payoff.  For  instance,  P’s  strategy  may  be  to  move  in  the  direction  that  locally  increases  the 
visibility  area  from  the  vantage  point  xp(t)  [CT05]. 

A  feedback  strategy  at  a  time  t  >  0  is  a  response  rule  of  a  player  based  on  the  system  states  up 
to  t.  For  notational  simplicity,  write  x(-)  :=  (xp{-),xp(-))  as  the  ordered  pair  of  E’s  and  P’s  paths. 
Let  V  =  (x(-) :  [0,oo)  ->•  fi|ee  |  x(0)  =  (r^,Xp)}  denote  the  set  of  paths  for  E  and  P  from  their 
respective  initial  states.  Then,  a  feedback  strategy  for  P  is  a  map  £ :  V  -A  A  that  satisfies 

1.  x(s)  =  5t(s)  for  all  8  £  [0,  t]  implies  £[x](s)  =  C[XK5)  f°r  all  8  £  [0 ,£], 

2.  for  all  Gp  £  A  and  T  >  0,  there  exists  a  unique  solution  to  (1)  with  ap{t)  =  C[x](£)  f°r 

t  £  (0, T\. 

Let  Tp  be  the  set  of  all  feedback  strategies  of  P,  and  let  (xe(-),  xp(-;  ge,  ())  fhe  solution  to  (1) 

corresponding  to  Gp  £  A  and  Gp  =  ((xp)  where  (  £  Tp.  We  then  define  the  payoff  function 

Jteedback(x°E,Xp,(TE,C)  =  inf{^  >  0  |  XE(t)  G  Acp(t;<7E,<)}- 
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In  a  feedback  strategy  game ,  at  each  time  t  >  0,  one  player  chooses  its  control  at  t  knowing  the 
opponent’s  state  from  times  0  up  to  and  including  t.  For  the  moment,  we  assume  the  player  choosing 
the  control  is  P. 

Definition  2.3.  The  (lower)  value  function  u :  fij?ree  -A  M  for  the  feedback  strategy  game  is 

(5)  u(x°E,X°p):=  sup  inf  ^feedback (xE,X°p,  <JE,  <)• 

C eTP °EeA 

Here,  for  any  of  P’s  feedback  strategy,  E  chooses  an  optimizing  control,  and  P  then  picks  out 
one  strategy  that  has  the  best  payoff.  It  can  be  shown  that  under  certain  conditions  the  value 
for  such  games  is  equal  regardless  of  which  player,  E  or  P,  chooses  the  control  and  which  reacts 
by  choosing  feedbacks.  Furthermore,  u  is  a  viscosity  solution  to  the  Hamilton- Jacobi- Is  sacs  (HJI) 
equation  [BCD97b,  CL83,  ES84] 

(6)  l=m&xmm{-fE(xE,aE)aE-\?XEu-fp(xp,aP)ap-\7Xpu}  on  g?ree\T, 

aPeA  crE£A 

where  u  =  0  in  T  =  {(x,y)  G  ^free  |  x  G  Vy}  and  Wxu  is  the  gradient  of  u  with  respect  to  the 
variable  x  G  f2free- 

In  contrast  to  the  feedback  strategy  game,  a  game  where  the  feedback  strategy  is  invoked  only 
at  t  =  0  is  called  a  static  game.  That  is,  at  the  start  of  the  game,  one  player  chooses  its  control  for 
all  t  >  0  by  countering  the  control  chosen  by  the  other  player  for  all  t  >  0,  and  the  other  player 
optimizes  knowing  this  response.  Indeed,  in  such  games,  the  optimal  occlusion  time  may  differ 
depending  on  which  player  (E  or  P)  plays  in  response  to  the  other.  In  this  article,  we  focus  on  the 
case  where  P  optimizes  assuming  that  E  will  always  optimally  counter  P’s  strategies.  For  initial 
states  x®E,  Xp  and  controls  ge ,  crp,  consider  the  payoff  function 

(7)  J(x°E,x°P,aE,ap )  =  inf{i  >  0  |  xE(t)  £  VXp (t)}. 

Here  and  throughout  this  article,  we  adopt  the  convention  that  the  infimum  of  the  empty  set  is 
infinity. 

Definition  2.4.  Define  the  (lower)  value  function  vs:  Q|.ee  -A  M  for  the  static  game  as 

(8)  vs(xE,x°p)  :=  sup  inf  J(x0E,x0P,aE,ap). 

apeAaEeA 

In  general,  it  can  be  shown  that  v s  <  u  [BCE+97].  Our  central  result,  presented  in  section  3,  is  an 
efficient  algorithm  which  computes  vs(x(E^x(p)  and  the  paths  corresponding  to  the  optimal  controls 
(or  e-suboptimal  controls,  see  Remark  2.2),  formally  written  as 

(9)  crpGargsup  inf  J(xE,  x?>,  aE,  gp), 

(7  PeAaE^A 

(10)  o*E  G  arg  inf  J (x°E ,  x°P ,  a E ,  Gp) , 

cje^A 

provided  the  game  ends  in  finite  time.  For  brevity,  we  shall  refer  to  the  static  visibility-based 
surveillance-evasion  game  simply  as  the  static  game  and  the  corresponding  value  (8)  as  the  static 
value. 

Remark  2.5.  Note  that  the  value  function  can  be  analogously  defined  for  a  “dual”  static  game  where 
E  optimizes  by  assuming  that  P  will  always  optimally  counter,  by  swapping  the  uinfcrse^”  and 
usupcrpG^”  in  (8);  this  will  yield  a(n)  (upper)  value  function  which  will  be  greater  than  or  equal  to 
vs  and  u  [BCE+97]. 

In  the  differential  game  theory  literature,  a  pursuit-evasion  type  game  typically  refers  to  (6),  i.e. 
the  boundary  value  PDE  problem  whose  solution  corresponds  the  feedback  strategy  value  function. 
We  emphasize  that  the  objective  of  this  article  is  not  to  solve  r,  but  to  solve  the  static  game  value 
function  vs  (and  the  associated  optimal  paths),  which  is  not  known  to  be  a  solution  to  a  PDE. 
While  this  demands  techniques  that  are  not  standard  in  the  numerical  PDE  literature,  it  avoids 
computational  issues  associated  with  solving  (6).  See  Remark  2.6  below. 
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Remark  2.6.  The  HJI  equation  (6)  immediately  suggests  a  numerical  algorithm  to  solve  the  feedback 
strategy  value  function.  However,  since  (6)  is  solved  on  the  joint  configuration  space  f2free,  even  the 
simplest  (non-trivial)  setting  of  two  players  in  a  two  dimensional  ambient  space  leads  to  a  problem 
in  at  least  four  dimensions.  For  problems  with  additional  players,  the  dimension  of  the  problem 
scales  exponentially  in  the  number  of  players.  The  high-dimensionality  of  the  problem  is  attributed 
to  the  information  contained  in  a  single  value  function:  the  optimal  control  can  be  extracted  from 
u  for  every  pair  of  initial  starting  positions  on  the  domain.  However,  for  a  problem  of  finding 
optimal  controls  for  a  single  prescribed  initial  pair  of  positions  (x^,  Xp),  solving  (6)  involves  excessive 
amounts  of  computational  time  and  data  memory. 

We  now  give  a  condition  such  that  the  static  value  function  (8)  is  finite. 

Proposition  2.7.  Assume  that  Ttfree  is  n°t  a  star  domain,  and  sup xeQfreeUE{x)  <  oo.  Then 
vs(x°E,x°p)  <  oo. 

Proof.  For  e  >  0  small,  let  tmax  =  sup^^  up{x )  +  e.  By  the  existence  of  e-suboptimal  controls, 

for  any  y  G  f4free,  there  is  a  path  xe(-)  such  that  xe( 0)  =  x%  and  xp(t max)  =  V- 

Choose  any  aP  G  A,  and  let  xP(-)  be  the  corresponding  path  for  P.  Since  the  domain  is  not 
star  shaped,  there  exists  y  G  f4free  such  that  xP(tma/)  G  Vy.  If  in  response,  E  chooses  aE  G  A 
corresponding  to  a  path  xE(-)  such  that  xE(tmax)  =  y  (which  exists  by  the  earlier  argument),  then 
J(xE,  xP,crE,crp)  <  tmax.  Since  E  has  a  response  control  such  that  J  <  tmax  for  any  control  choice 
of  P,  we  conclude  that  vs(xE,  xP)  <  tmax  <  oo.  □ 

This  proposition  says  that  the  static  game  will  terminate  in  finite  time,  for  example,  if  E  travels 
with  positive  speed  on  a  bounded,  path  connected  f2free  that  is  not  a  star  domain.  This  rather 
strong  statement  results  from  the  fact  the  entire  future  control  values  crp{t)  and  a  pit)  for  both 
players  are  chosen  only  once  at  initial  time ,  and  no  future  revision  is  allowed.  Looking  ahead,  the 
last  proposition  is  useful  in  justifying  the  use  of  our  algorithm  in  the  numerical  examples  discussed 
in  Section  3.  In  Section  4.2,  we  present  a  scenario  in  which  optimal  controls  for  both  players  are 
allowed  to  be  revised  at  discrete  time  increments  as  the  game  progresses. 

We  close  this  section  by  proving  a  simple  observation,  to  be  used  later. 

Proposition  2.8.  For  a  stationary  P,  i.e.  xp(t )  =  £p,Vt  >  0,  the  optimal  control  for  E  for  the 
static  game  is  the  time  optimal  control  from  x°E  to  the  set  Vxop . 

Proof.  Fix  P’s  control  aP  to  have  zero  control  value  0  G  A  for  allt  >  0.  Then, 

vs(x0E,x°p)=  inf  J(x°E,Xp,  crE,  <7 p) 

aEeA 

=  inf  inf{£  >  0  |  xp{t)  G  Vxo  }, 

crEeA  p 

which  precisely  characterizes  the  optimal  control  of  E  as  desired.  □ 

2.3.  Notations  and  game  assumptions.  Since  we  will  be  considering  “optimal  paths/controls” 
in  the  context  of  both  the  differential  game  and  the  related  optimal  control  problems,  we  choose 
to  call  optimal  paths /controls  (or  e-suboptimal  paths/controls)  as  those  corresponding  to  the  static 
game,  and  time  optimal  paths/controls  (or  e-time  suboptimal  paths/controls)  as  those  corresponding 
to  minimum  arrival  time  optimal  control  problems,  such  as  in  (2)  and  (3). 

Throughout  this  article,  we  use  the  following  notations: 

•  Cl  C  Mn  is  an  open  and  bounded  ambient  space;  we  focus  particularly  with  n  =  2,  but  the 
theory  applies  naturally  to  higher  dimensions. 

•  flfree  U  f40bs  =  ^  partitions  the  ambient  space  into  the  freespace  and  obstacles.  We  assume 
that  flfree  is  open. 

•  E  and  P  are  the  two  competing  players,  representing  the  evader  and  the  pursuer,  respectively. 

•  x,y  are  generic  points  in  Q. 

•  Vx  C  £2free  is  the  set  of  occluded  points  from  the  vantage  point  x  G  fifree. 

•  cr#,  crp :  t  G  [0,  oo)  — >  A  are  the  controls  of  E  and  P  at  time  t  >  0,  respectively. 
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•  xe,%p'-  t  G  [0,  oc)  -A  £lfree  are  E  and  P’s  respective  positions  at  time  t  >  0;  we  suppress 
their  dependence  on  the  initial  states  x®E,  x®P  and  controls  ge ,  crp. 

•  x°E,x°p  G  ^free  are  the  fixed  initial  positions  of  E  and  P,  respectively. 

•  xE,Xp  G  fifree  are  the  final  positions  of  E  and  P,  respectively,  to  be  computed. 

•  vs  :  fifree  -A  M+  U  {0}  is  the  value  of  the  static  game,  see  (8). 

Furthermore,  we  shall  make  the  following  assumptions  on  the  games  introduced  henceforth: 

(Bl)  xE  0  Vxo  ,  i.e.  the  players  are  initially  visible  to  each  other. 

(B2)  both  players  are  aware  of  their  opponents’  dynamics, 

(B3)  both  players  know  their  opponents’  positions,  if  and  only  if  they  are  visible  to  each  other. 

Remark  2.9.  Given  a  vantage  point  x  G  f2free,  a  level  set  function  £(•;#):  Q,  -A  M  representing 
Vx  C  $1,  where 

ff(2/;z)  <0  if  y  G  Vx 
\^{y;x)  >0  if  ye  Q\DX 

can  be  computed  in  O(N)  time  on  a  grid  with  N  nodes,  using  the  level  set  visibility  function 
[TCO+04].  The  details  are  described  in  Appendix  A. 

3.  An  algorithm  for  the  static  value  function 

In  this  section,  we  present  an  algorithm  which  computes  the  value  function  and  optimal  paths 
corresponding  to  the  optimal  controls  (9)  and  (10).  Our  algorithm  depends  crucially  on  a  maximal 
set  R*  C  f2free  and  two  functions  t*(--,xE)  and  wr*  (•;  x°P)  which,  in  some  sense,  decouple  the  analysis 
of  the  dynamics  of  E  and  P. 

Definition  3.1.  Given  x  G  f2free, 

(11)  t*(x;x°E):=  inf  ini{t  >  0  \  xE(t)  e  Vx,xE(0)  =  x°E}. 

aEeA 

That  is,  t*(x;xE)  is  the  shortest  time  for  E,  starting  from  xE,  to  become  occluded  from  a  sta¬ 
tionary  vantage  point  x  G  fifree.  Note  that  the  two  players  must  be  initially  visible  to  each  other  by 
assumption  (Bl),  so  t*(x^,]x^)  >  0. 

Definition  3.2.  For  any  subset  R  C  f2free  containing  Xp,  let 

(A r)  wr(x;x°p )  =  inf  inf{£  >  0  |  xp(0)  =  x°P,xp(t)  =  x,xp(s)  G  R,  Vs  G  [0,t]}. 

crPeA 

That  is,  wr(x ;  x°P)  is  the  minimum  time  for  P  to  reach  x  from  x°P  by  traveling  along  a  path  that 
is  contained  in  R.  By  the  small-time  controllability  assumption  (A2),  the  function  wr(-;  x°p)  is  finite 
and  continuous  in  any  pathwise  connected  subset  of  R  containing  x°p.  Next,  we  relate  t*  and  wr  by 
an  inequality  over  a  maximal  set  R*: 

Definition  3.3.  Let  R*  C  f2free  be  the  maximal  set  containing  Xp  such  that 

(12)  wR*{y;x°p )  <  t*(y;x°E),  for  all  y  e  R* . 

In  other  words,  for  any  RD  R*  and  y  G  R\R*,  we  have  WR(y ;  x°P)  >  t*(y;  x°E). 

We  remark  that  the  maximality  of  R*  implies  its  uniqueness.  The  existence  follows  from  the  fact 
that  property  (12)  holds  trivially  for  the  subset  {x°P}  C  ^free- 

Roughly  speaking,  The  functions  t*  and  wr*  defined  above  quantify  the  reachability  of  the  two 
players  by  time  under  optimal  choice  of  controls;  this  allows  us  to  characterize  P’s  possible  controls 
(or  paths)  that  maintain  visibility  of  E  under  worst  case  scenarios.  See  Figure  1. 

3.1.  r-admissible  pursuit  paths.  By  Proposition  2.8,  assuming  a  stationary  P,  an  optimal  strat¬ 
egy  for  E  is  to  move  towards  the  occlusion  set  Vxop  via  a  time-optimal  path,  provided  it  exists.  In 
this  case,  it  will  take  t*(xp;xp)  time  for  an  optimally  moving  E  to  become  occluded  from  P.  This 
also  implies  that  xe(t)  0  for  0  <  r  <  ^(x^x^)  and  ge  G  A.  To  consider  the  case  of  a 
non- stationary  P,  we  introduce  the  following  notion: 
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Figure  1.  An  illustration  of  the  relations  between  t*(y;  xE )  and  wr *  (y;  x°P).  Both 
players  are  assumed  to  have  unit  speeds.  The  shaded  circle  represents  the  obstacle, 
d\  is  the  distance  between  x®E  and  the  obstacle,  and  6  is  some  small  positive  constant. 
The  precise  shape  of  the  curve  that  corresponds  to  wr*  =  t*  is  not  drawn  accurately 
in  this  illustration.  In  this  scenario,  the  game  will  last  until  t  >  d\  since  E  will  not 
be  able  to  reach  the  shadow  boundary  before  P  pushes  the  shadow  boundary  further 
away. 


Definition  3.4  (r-admissible  pursuit  path).  For  a  r  >  0,  we  say  that  a  path  xp(-)  of  P  is  a 
r-admissible  pursuit  path  if 

(13)  t  <  t*(xp(t);  x°E)  for  allt  G  [0,  r). 

Set  T  :=  vs(x(E,x(P),  where  vs  is  defined  in  (8).  Then,  an  optimal  path  xp(-)  for  P  is  a  In¬ 
admissible  pursuit  path.  If  not,  s  >  t*(xp(s);xE)  at  some  s  <  T;  this  implies  that  there  exists  a 
control  (Je  E  A  such  that  xp(s)  E  VXp (s),  thereby  prematurely  terminating  the  game  no  later  than 
s.  Based  on  this  observation,  P’s  optimal  path  is  a  r-admissible  pursuit  path  with  the  largest  r;  this 
corresponds  to  reaching  a  final  point  xp  which  maximizes  t*(--,xE).  As  we  shall  show  later  (Lemma 
3.6),  the  set  R*  coincides  with  the  set  of  points  reachable  by  r-admissible  pursuit  paths.  Therefore, 
the  static  value  can  be  related  to  R*  and  t*(-]x%)  as  follows: 

Theorem  3.5.  vs(x(E,x(p)  =  supxeR*  t*{x\x®E),  provided  both  sides  are  finite. 

We  note  that  it  is  possible  for  t*(--,xE)  to  be  infinite,  for  example,  if  Qfree  is  star  shaped  (see 
Proposition  2.7).  While  the  equality  in  Theorem  3.5  may  still  be  valid  in  such  cases,  we  shall  not 
pursue  them  further.  We  give  the  proof  of  Theorem  3.5  at  the  end  of  this  subsection.  First,  as 
claimed  earlier,  we  show  that  all  points  in  R*  are  precisely  the  points  “reachable”  by  r-admissible 
pursuit  paths  that  start  from  Xp. 


Lemma  3.6. 
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1.  For  any  arbitrary  point  y  £  R* ,  there  exists  a  r -admissible  pursuit  path  xp(-)  such  that 
xp(t )  =  y  for  some  t  £  [0,  r),  if  there  exists  a  time-optimal  path  in  R*  from  Xp  to  y. 

2.  For  any  y  £  flfree\R* ,  there  does  not  exist  a  r -admissible  pursuit  path  xp(-)  such  that 
xp(t )  =  y  for  some  t  £  [0 ,  r). 

Proof.  1.  We  construct  a  r- admissible  pursuit  path  from  a  time  optimal  path  for  the  minimum 
arrival  time  problem  corresponding  to  (A#*).  Choose  a  point  y  £  R* ,  and  define  T, r  >  0  as 

(14)  T  :=wR.(y;x°P)  <t*(y;x°E)  =:  r. 

Let  xp(-)  be  the  time  optimal  path  from  xP  =  xp(0)  to  y  =  xp(T)  such  that  {xp(t)  \  t  £  [0,T]}  C 
R*.  Then 

(15)  t  =  wR*(xP(t)]Xp)  <  t*(xp(t)m,xE),  for  t  £  [0,T]. 

The  first  equality  follows  from  a  property  of  time-optimal  paths,  c.f.  Corollary  2.3  in  [Vla06].  For 
t  >  T,  set  xp(t)  =  y ,  i.e.  P  is  stationary  at  y.  Then,  from  Definition  (3.1), 

t  <  t*(y;x°E)  =  t*(xP(t);x°E),  for  t  G  [T,t). 

Thus,  xp(-)  is  a  r-admissible  pursuit  path  such  that  xp(T)  =  y  for  T  £  [0,r). 

2.  If  such  a  r-admissible  pursuit  path  existed,  then  WR*(y;xP )  <  t*(y;xE);  this  contradicts  the 
maximality  of  R* .  □ 

Remark  3.7.  Suppose  part  1  of  Lemma  3.6  did  not  assume  the  existence  of  a  time-optimal  path  from 
ip  to  y  in  R*.  In  the  spirit  of  Remark  2.2,  we  can  modify  Definition  3.3  and  the  proof  in  Lemma  3.6 
to  characterize  a  sequence  of  paths  in  R *  satisfying  (15)  in  the  limit.  Namely,  for  fixed  5  >  0  define 
R$  C  fifree  to  be  the  maximal  set  containing  xP  such  that 

(16)  wR*  (x;  x°P)  <  t*  (x\ x°E )  —  5,  for  all  x  £  . 

It  can  be  verified  that  U<5>o  =  ^*-  Then,  consider  xep(-),  an  e-time  suboptimal  path  in  R$.  It 

can  be  shown  that 

t  <  t*(xp6(t );  xE)  —  5/2,  for  t  £  [0,  T]. 

Thus,  the  limiting  path  as  e  — )>  0  followed  by  5  — »  0  satisfies  the  conditions  in  (15).  The  rest  of  the 
proof  follows  with  the  same  argument  as  in  the  earlier  proof. 

We  close  this  subsection  with  the  proof  of  Theorem  3.5. 

Proof  of  Theorem  3.5.  By  Lemma  3.6  and  Remark  3.7,  all  [0,  r)  time  portions  of  r-admissible  pur¬ 
suit  paths  lie  inside  R*.  Therefore,  P’s  static  value  is  bounded  above  by  supxeR*  t*(x;  x^ff),  i.e. 
vs(x°E,x°p)  <  supxeij.  t*(x;x°E). 

We  claim  that  vs(x(E,x(P)  >  supxeR*  t*(x;  x®E).  Consider  a  sequence  {^}^0  in  converging 
to  y  £  R*  so  that  :=  Pfypx^)  — >  supxeR*  t*(x\x®E)  as  i  -A  oo.  By  Lemma  3.6,  there  is  a  tp 

admissible  pursuit  path  for  each  i  —  0, 1,  2, - This  implies  that  vs(xE,xP)  >  Ti  for  all  i.  Taking 

i  -A  oo,  we  have  the  desired  inequality.  □ 

3.2.  Optimal  paths.  In  this  section,  we  discuss  a  characterization  of  optimal  paths  of  P  and  E  for 
the  static  game. 

From  the  proof  of  Theorem  3.5,  each  of  P’s  optimal  paths  will  reach  an  optimal  final  point  xp  £  R* 
such  that  there  exists  a  sequence  in  converging  to  xp,  and  t*(yp  x®E)  — >  supxeR *  t*(x;  x®E) 

as  i  -A  oo.  We  shall  denote  all  possible  optimal  final  points  as  “argsup^^*  £*(x;  a^)”. 

As  for  E’s  optimal  paths,  we  first  recall  the  optimal  control  value  function  uR  is  defined  in  (2). 
Since  E  has  a  strategy  to  become  occluded  from  P  at  time  vs(x(E,x(P)  (see  Theorem  3.5),  the  final 
state  xE  would  be  any  point  that  is  reachable  from  xE  within  time  vs(xE,xP)  and  is  occluded  from 
the  point  xp,  i.e. 

XE  e  Vxfp  n  iX  I  Ue(x)  <  Vs(x°E,X°p)}. 

The  set  on  the  right  hand  side  is  non-empty:  t*(xp;xE)  =  vs(xE,xP)  implies  that  there  exists  a 
<je  £  A  for  E  to  occlude  from  the  vantage  point  xp  at  time  vs(x(E,x(P).  The  optimal  path  for  E  is 
therefore  its  time-optimal  path  from  x ^  to  xE  in  Dfree. 
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3.3.  Construction  of  R*.  Recall  that  R*  is  the  maximal  set  such  that  the  function  wR*  satisfies 
the  inequality  (12).  This  characterization,  however,  is  not  constructive.  To  this  end,  consider  a 
sequence  of  sets  {R^}?L0  defined  as  follows: 

Ro  ■=  {x  e  ftfree  I  UP(x)  <  t*(x;X°E)}, 

(17) 

Ri+1  1=  { X  G  Hfree  |  ^ Ri(x'i  X p)  ^  t  [x]  X p)} 

where  up(x )  is  the  minimum  time  for  P  to  arrive  at  x  from  the  starting  location  £p,  defined  as 
per  (3). 

Lemma  3.8.  Ro  D  R*. 

Proof.  We  prove  by  a  contrapositive  argument.  Suppose  we  have  up(x;x°P)  >  t*(x;x°E),  i.e.  x  G 
Hfree\Ro-  Then,  no  path  for  P  can  reach  x  before  t*{x\x®E),  the  time  necessary  for  E  to  become 
occluded  from  x.  This  implies  that  no  point  on  the  [0,  r)  portion  of  any  r-admissible  pursuit  path 
would  coincide  with  x.  Thus,  by  Lemma  3.6,  x  G  Ilf ree\R*.  □ 

The  following  theorem  proves  that  Ri  converges  monotonically  to  a  set  that  is  contained  in  R* 
and  that  contains  R*: 

Theorem  3.9.  The  sequence  {Ri}pL0  satisfies 

1.  Rip i  C  Ri  for  all  i  =  0, 1,  2, ... ,  and 

2.  R*  c  a=i  Ri  Q  R*. 

Remark  3.10.  For  numerical  purposes,  when  the  set  R^  :=  fj ^=1Ri  is  approximated  by  discrete 
points  on  a  finite  grid,  R*  and  R*  are  indistinguishable.  Therefore,  when  applying  Theorem  3.5  in 
the  forthcoming  algorithm  (see  Algorithm  1),  we  are  justified  in  treating  R0 0  to  be  R* . 

Proof  of  Theorem  3.9.  1.  By  Lemma  3.8,  the  initial  set  Ro  is  a  superset  of  R*.  Then  we  note  that 

(18) 

Ri+ 1  =  {x  e  fifree  |  WRi(x]X°p)  <  t*  (x;  X°E)j  C  {x  G  fifree  |  WRi(x]X°p)  <  Oo}  C  Rh  1  =  0,  1,  2,  •  •  •  . 

The  last  inclusion  follows  from  Definition  3.2  that  wRi  is  infinite  in  Df ree\Ri.  Define  R0 0  :=  fj°^0  Ri- 

2.  To  see  that  R^  DR*,  we  show  by  induction  that  Ri  DR*  for  all  i  =  0, 1,  2 -  The  base 

case  i  =  0  was  proved  in  Lemma  3.8.  Assume  Ri  D  R*.  Then,  wpi  (x\x*p)  <  wR*(x;  xj,)  for  all 
x  G  R*.  From  Definition  3.3,  wR*(y;x(p)  <  t*(y,x®E)  for  all  y  R*  C  Rp  The  last  two  facts  imply 
wRi(y,x°P)  <  wR*(y-x°P ),  so  y  G  R;+i.  Therefore,  Ri+ 1  D  R*. 

Next,  we  show  Roo  C  R*.  Fix  a  5  >  0  small  and  define  the  sequence  of  sets  {Rf}-^0  as  follows: 


(19) 


Ro  :=  {x  G  Dfree  |  uP(x;x°p)  <  t*(x;xE)  -  5}, 

Rip  1  :=  {x  G  free  |  WRs(x]Xp)  <  t* (x;  xE)  -5},  i  =  0, 1,  2, - 


We  define  R^  :=  fj^o  Rf-  The  inclusions  Rf+1  C  Rf  can  be  shown  via  a  similar  argument  in  (18). 
We  claim  that 


(20) 


Roo  —  ^  ^Ifree  |  x p)  <  t  (x;Tp)}. 


Thus,  by  the  maximality  of  R*,  we  have  R^  C  R*.  Since  R^  C  (J5>0  R^  and  U<5>o  ^oo  ^  R*,  we 


conclude  that 


i?oo  c  (j  c  ij*, 

<5  >0 


as  desired.  To  show  (20),  first  note  that 


RSoo  =  fiu  G  ^free  I  wRsi(x’x°p)  <  t*(x;x0E)  -  (5), 


i= 0 


D  G  Hfree  |  ^R^i^XjXp^j  ^  t  (x,Tp)  $ } , 

C  {x  G  Hfree  |  WpS^^Xp)  <  t*(x;XE)}. 

Next,  to  show  R^  D  {x  G  Hfree  |  wRs^(x;  x°P)  <  £*(x;£p)},  take  any  y  G  Hfree  such  that 
u;p<5  (y\x®P)  <  £*(?/; £p);  clearly  (y\x®P)  is  finite,  therefore  y  G  R^.  This  proves  (20).  □ 
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3.4.  Algorithm  and  examples.  We  summarize  the  steps  described  in  the  previous  sections  in 
Algorithm  1.  Note  that  we  have  omitted  details  on  computing  optimal  control  value  functions  and 
time-optimal  paths  within  the  algorithm;  this  is  in  the  spirit  of  both  simplifying  the  presentation 
and  emphasizing  the  independence  of  the  algorithm  to  particular  computational  methods. 


Algorithm  1:  Algorithm  for  the  static  visibility-based  surveillance-evasion  game. 
Input  :  Domain  information  U  =  Dfree  U  D0bs- 

Initial  states  x°E ,  x°P  and  dynamics  /#,  fp. 

Output:  Optimal  occlusion  time  T  =  vs(x(E^x(P). 

Optimal  paths  x*E,x*P :  [0 ,  T]  -A  Dfree. 

Step  1 

Compute  functions  ue{x),up(x)R*(x;x(e)  on  x  G  Ufree; 

Step  2  (construct  R*) 

Ro  ■=  {x  |  Up(x)  <  t*(x;x%)} 
foreach  i  =  0, 1,  2, . . .  until  Ri  stop  changing  do 
Compute  wpi  on  Rp 
Ri+1  ■=  {x  I  wRi(x)  <  t*(x;x°E)}; 

R*  :=  Ri+ 1; 

Step  3  (extract  static  value  and  optimal  paths) 

Return  T  :=  supxeR*  t*(x;x°E); 

Find  xp  G  arg  sup^^*  t*(x;x°E ); 

Return  the  time-optimal  path  from  x°P  to  xp  in  R*; 

Choose  xE  G  Vxf  D  {x  \  up{x)  <  T}; 

Return  xE(-),  the  time-optimal  path  from  xE  to  xp  in  Ufree; 


Remark  3.11.  Since  wRi  is  simply  a  minimum  arrival  time  value  function,  it  can  be  computed 
efficiently  by  solving  a  Hamilton- Jacobi  equation  numerically,  see  Remark  2.1. 

Remark  3.12.  If  state-of-the-art  numerical  Hamilton- Jacobi  solvers  are  employed  in  Step  1  to  solve 
ue(-),  up{-)  and  t*(-m,xE),  the  bulk  of  the  computation  would  be  spend  in  the  latter  function;  it 
involves  the  computation  of  Vx  (see  Remark  3.11)  and  the  minimum  arrival  time  function  to  Vx 
from  y  for  each  x,y  G  fifree.  If  £2free  C  M2  is  discretized  on  a  N  x  N  grid,  this  would  involve 
at  least  0(N 4)  operations:  0(N 2)  for  computing  the  level  set  function  representation  for  Vx  (see 
Appendix  A)  and  at  least  0(N 2)  for  solving  the  minimum  arrival  time  to  Vx ,  c.f.  [TCOZ03].  While 
the  worst-case  computational  complexity  is  comparable  to  solving  the  feedback  strategy  game  via 
a  Hamilton- Jacobi-Isaacs  equation,  we  mention  a  few  advantages  of  solving  the  static  game  using 
Algorithm  1: 

1.  The  most  costly  part  of  the  algorithm  is  easily  parallelizable:  t*(x\x®E)  can  be  evaluated 
independently  for  each  discrete  node  x  in  Dfree. 

2.  Small  memory  footprint:  the  memory  complexity  is  only  0(N 2)  since  for  each  node  x,  the 
level-set  representations  of  Vx  and  the  corresponding  minimum  arrival  time  function  need 
not  be  stored  once  t*(xm,xE)  is  evaluated. 

3.  Exact  solutions:  for  domains  with  simple  geometry  (e.g.  circles,  polygonal  obstacle  bound¬ 
aries)  and  constant  speeds,  it  may  be  possible  to  derive  an  analytical  formula  for  t*(x;x^) 
for  x  G  f^free  5  see  Examples  1  and  2  below. 

Next,  we  present  three  examples  demonstrating  our  algorithm.  In  all  test  cases,  D  =  [ — 1 , 1] 2 . 
Computations  were  performed  on  a  desktop  running  on  a  3.33  GHz  dual-core  CPU  with  4  GB  of 
RAM  memory.  For  examples  1  and  2,  computations  took  less  than  a  second,  since  an  exact  formula 
for  £*(•;  xE)  was  employed.  For  example  3,  the  complete  computation  took  approximately  5  minutes. 
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3.4.1.  Example  1:  circular  obstacle.  The  first  example  concerns  a  single  circular  obstacle,  with  E 
and  P  placed  as  shown  in  Figure  2  (a). 

Due  to  the  simple  geometry,  it  is  possible  to  analytically  derive  a  formula  for  t*  {x\x®E)  for  x  G  Dfree. 
All  other  functions,  ue,  up  and  w r.  {i  =  0, 1, . . . )  were  approximated  on  a  uniform  cartesian  grid  of 
size  4002.  The  function  t*(-m,xE)  is  shown  in  Figure  3  (a).  In  Figure  3  (b),  the  computed  R*  using 
the  algorithm  17  is  plotted,  as  well  as  the  optimal  paths  of  E  and  P. 


Figure  2.  Obstacle  (shaded)  and  initial  states  for  (a)  Example  1  and  (b)  Example  2. 


Figure  3.  Example  1.  (a)  Surface  plot  of  where  x®E  =  (0.8, 0.6).  (b)  A 

plot  illustrating  R*  (blue  curve),  t*(-;x^)  (background),  xE  (blue  circle),  xE  (blue 
square)  and  the  optimal  paths  of  P  (blue  dotted  curve)  and  E  (red  dotted  curve) , 
where  the  isotropic  speeds  are  /p  =  1,/p  =  1.2.  The  purple  curve  represents  the 
visibility  region  from  the  vantage  point  xE. 


3.4.2.  Example  2:  two  corridors.  Next  we  consider  a  case  where  xE ,  x°P  and  £2free  are  shown  in 
Figure  2  (b).  Again,  due  to  the  simplicity  of  the  geometry  of  the  domain,  we  derived  an  analytical 
formula  for  t*(x,x®E)  for  x  G  Qfree- 

For  the  sake  of  analyzing  the  solution,  we  call  the  set  S  to  be  the  square  (x,  y)  G  [—0.25,  0]  x  [0,  0.25] 
(the  north-east,  south-west  and  south-east  vertices  of  S  coincide  with  the  vertices  near  the  origin 
of  the  three  obstacles).  Also,  we  call  C\  the  set  of  points  (x,y)  G  [0, 1]  x  [0,0.25]  in  the  “horizontal 
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corridor”  between  the  upper  right  and  bottom  right  rectangular  obstacles.  Likewise,  we  call  C2  the 
set  of  points  (x,y)  £  [—0.25,0]  x  [—1,0]  in  the  “vertical  corridor”  between  the  lower  left  and  bottom 
right  rectangular  obstacles.  Note  that  the  set  of  states  where  P  has  the  greatest  visibility  of  domain 
is  5;  this  is  reflected  by  the  peak  in  t*(x;xE )  for  x  £  S,  shown  in  Figure  4  (a)  and  (b).  This 
prompts  a  conservative  P  to  move  towards  S  via  a  time-optimal  path,  and  remain  stationary  until 
the  game  terminates.  Indeed,  if  P  “commits”  to  entering  C\  then  E  would  counter  by  moving  into 
C2,  thereby  ending  the  game  prematurely;  a  similar  argument  follows  if  P  commits  to  entering  C2. 
Consequently,  E’s  final  state  xE  to  become  occluded  from  a  P  in  S'  would  either  be  the  south-west 
or  north-east  vertex  of  the  bottom  right  obstacle,  whichever  is  closer  to  xE ;  in  the  present  example, 
xE  is  the  former. 


Figure  4.  Example  3.  (a)  Surface  plot  of  £*(•;  x®E),  where  x®E  =  (—0.6, 0.15).  (b)  A 
plot  illustrating  R*  (blue  curve),  £*(-;^p)  (background),  xE  (blue  circle),  xE  (blue 
square)  and  optimal  paths  of  P  (blue  dotted  curve)  and  E  (red  dotted  curve),  where 
the  isotropic  speeds  are  /p  =  1,  /p  =  1.2.  The  purple  curves  represent  the  visibility 
region  from  the  vantage  point  xE. 


3.4.3.  Example  3:  inhomogeneous  speeds.  In  Figure  5,  we  illustrate  how  inhomogenous  speed  func¬ 
tions  /p  and  fp  can  be  handled  using  the  proposed  algorithm.  The  following  isotropic  velocity 
functions  were  used:  for  all  dp  £  A  and  dp  £  A, 

/p(x,  dp)  =  fp(x,  dp)  =  f(x)  =  1  —  0.8  sin(47rxi)  sin(47TX2)  where  x  =  (aq,  #2). 

Since  an  analytical  formula  for  t*(-]x(E)  is  too  complicated  to  derive,  it  was  constructed  numerically 
via  Definition  3.1.  All  computations  were  performed  on  a  2002  domain.  Note  that,  by  observing 
Figure  5  (b),  both  players  travel  where  their  speeds  are  the  greatest  (yellow  regions). 

4.  Applications  and  extensions 

In  this  section,  we  present  several  immediate  applications  and  extensions  of  the  static  game 
algorithm.  In  section  4.1,  we  show  that  for  a  generalized  version  of  the  visibility  pursuit-evasion 
game  with  multiple  evaders,  the  computation  scales  linearly  with  the  number  of  players.  In  section 
4.2,  we  present  a  feedback  control  strategy  by  invoking  the  static  game  algorithm  iteratively  over 
short  time  intervals. 

4.1.  Multiple  evaders.  A  natural  extension  to  the  present  algorithm  is  the  visibility  based  single¬ 
pursuer,  multiple-evader  (static)  game.  Suppose  there  are  k  evaders,  Ei,  E2, . . . ,  E&,  where  E^’s 
motion  satisfies  the  initial  value  problem 

xEi(t)  =  fEi{xEi{t),aEi{t))aEi{t),  xEi(0)=xE.  <E  fifree  for  each  i  =  1, 2, . . . ,  k. 
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Figure  5.  Example  2.  (a)  A  plot  illustrating  R*  (blue  curve),  t*(-;x°E)  (back¬ 
ground),  xE  (blue  circle),  xE  (blue  square)  and  optimal  paths  of  P  (blue  dotted 
curve)  and  E  (red  dotted  curve),  with  an  intensity  plot  of  t*(-;T^)  in  the  back¬ 
ground,  with  the  speed  function  f(x).  (b)  Same  plot  as  (a),  except  with  the  speed 
function  f(x)  in  the  background. 

For  each  i,  the  control  crEi  G  A  is  chosen  as  with  the  two  player  game.  We  assume,  for  each  i,  the 
velocity  fEi  satisfies  the  conditions  (A1)-(A3). 

Given  Xp,  {x^.}^^  and  crp  G  A,  {<JEi  G  A}^=1,  consider  the  payoff  function 


(21) 


Jk{x°El,  ■  ■  ■ ,  x°Ek,  x°p,  aEl,  ■  ■  ■ ,  aEk,  crp)  =  min  J(x°E  ,x°P,aEi,aP), 

1=1  ,...,/c 


where  J  is  the  payoff  function  for  the  two-player  game  (7).  Here,  the  game  terminates  the  first 
instance  that  at  least  one  evader  becomes  occluded  from  the  pursuer.  The  key  property  is  that  each 
E i  plays  independently  against  P,  thus  solving  this  game  amounts  to  k  instances  (for  each  E$)  of 
the  two-player  game.  As  with  the  two-player  game,  all  players  choose  their  complete  controls  at  the 
start  of  the  game. 

Let  t*(-;Tp;.)  be  the  function  defined  as  per  (11)  with  initial  state  x®E..  Then,  analogous  to  (13), 
the  r-admissible  pursuit  path  xp(-)  for  P  among  the  E^’s  must  satisfy  t  <  t*(xp(t);  xE.)  for  all 
f  G  [0,r)  and  i  =  1  or  equivalently, 

t<  min  t*(xp(t)m,  x°E.)  for  allt  G  [0,  r). 

i=l,...,k  z 

The  rest  of  the  algorithm  follows  the  same  procedure  as  with  the  two-player  game,  where  t*(x;x°E) 
is  replaced  by  mnq=i a?^.);  the  definitions  of  i?*,  wp*,  etc.  can  be  defined  accordingly.  For 
example,  the  static  value  function  for  k  evaders  becomes 

vk(xEl,---,xEk,x0p)  =  sup  min  t*(x;x°E), 

xeR*  i=h--,k 

and  the  final  state  of  P  is  xE  =  argsupa,ei?^  min^p...^  t*(x;  x®E.).  Furthermore,  there  exists  i  G 
{1, . . . ,  k}  and  (a  final  state)  xE  G  £2free  such  that  xE  is  reachable  by  E^  (via  a  time-optimal  path) 
in  time  Vk (xEl ,  -  •  • ,  %Ek ,  Xp)  and  occluded  from  xE. 

Remark  4.1.  Since  each  t*(--,xE  )  can  be  computed  independently  on  the  underlying  domain,  the 
computational  complexity  for  multiple  evaders  scales  linearly  with  the  number  of  players. 


4.2.  Applications  to  a  feedback  control  game  and  target  tracking.  Let  us  return  to  the 
two-player  game  setting,  E  versus  P.  In  realistic  settings,  controls  of  one  or  more  players  are  chosen 
using  feedback  controls,  rather  than  open  loop  controls.  To  model  a  feedback  control  strategy,  one 
can  advance  the  players  over  short  time  intervals  according  to  the  optimal  controls  computed  by 
Algorithm  1,  starting  at  the  current  states. 
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Figure  6.  The  solution  to  a  visibility  single-pursuer,  three-evader  static  game. 


Suppose  the  player  states  are  sampled  at  discrete  times  A  =  (0  =  to,  ti,  t2, . . . ,  tn),  where  U  <  ti+i 
for  all  i  =  0, 1,2,  The  aforementioned  feedback  control  strategy  can  be  stated  as  follows. 

Suppose  E  and  P  initially  start  at  states  x®E  and  ip,  respectively.  We  write  the  player  states  in  each 
time  interval  as 


{xE(t)  =  X$(t  -  ti), 
pp(0  =  x^(t  -  ti), 


t  £  [U,  U+ 1]‘ 


In  the  time  interval  [t^,t^+i],  the  states  satisfy  the  dynamics 
(22)  Fe(0  =/i?(IE)(t).$(<))$(t). 

;  1 4\t)  = 


t  e  [o,  u+i  -  ti]. 


with  initial  conditions  defined  recursively 

(23)  (4)(0),4)(0))=(5^  y 

[  \XE1  xp) 


ti-l),  Xp 


if  i  >  1, 
if  i  =  0. 


For  fixed  states  xe,xp  E  f2free>  the  feedback  controls  <£>(£)  =  xp\  •)  G  ^4  and  T(t)  = 

T(x^,xp;-)  G  ^4  are  the  static  game  controls  computed  from  the  joint  state  configuration  at  the 
discrete  time  sets: 


(24) 


^(xE,xP;  •)  G  argsup  inf  J(xE,xP,aE,crp ) 

aPeAaEeA 

$(xE,xp-r)  €  arg  inf  J{xE,xP,aE,^{xE,xP-,  •)))• 
cr  EeA 


In  Figure  7,  We  show  a  comparison  of  the  static  and  feedback  control  games  on  a  simple  example. 
The  players  both  have  unit  speeds  in  a  square  domain  [—1,1] 2  with  a  single  circular  obstacle. 
Here,  we  sampled  and  recomputed  the  controls  at  equal  intervals  A  =  (0,  At,  2At,  3At, . . . )  where 
At  =  0.025;  the  iterative  procedure  was  performed  until  E  became  occluded  from  P.  Notice  how 
the  feedback  strategy  game  prolonged  the  game  duration  compared  to  the  static  game.  This  makes 
intuitive  sense:  since  the  static  value  (8)  corresponds  to  the  most  conservative  strategy  for  P,  the 
iterative  procedure  relaxes  this  conservatism  and  thus  yields  a  more  favorable  outcome  for  P. 


Remark  4.2.  In  general,  the  set  of  optimal  controls  above  are  not  unique.  Take  for  instance  example 
2  (the  two-corridor  example):  all  time-optimal  controls  for  P  that  reach  a  (final)  point  xE  G  S 
are  optimal  controls  for  the  static  game.  While  the  static  values  are  all  equal  for  all  such  optimal 
controls,  some  choices  may  be  more  advantageous  since  it  will  directly  affect  the  feedback  control 
in  subsequent  iterations.  As  a  sensible  strategy  to  uniquely  choose  among  the  optimal  state  game 
controls,  at  each  step  i,  we  implement  P  to  choose  the  control  corresponding  to  the  final  point  Xp 
such  that  \\xp  —  xe(U)\\2  is  minimized,  i.e.  choose  the  control  to  move  to  the  point  closest  to  the 
current  state  of  E. 
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Figure  7.  A  comparison  between  the  static  and  feedback  control  games.  From 
left  to  right:  the  initial  states  and  obstacles,  the  optimal  static  game  paths,  and 
the  optimal  feedback  control  game  paths.  In  the  feedback  control  game,  we  have 
plotted  the  sequence  of  positions  of  the  players  as  well  as  the  R*  at  each  iteration. 

As  another  illustration,  we  show  snapshots  of  the  optimal  paths  computed  using  the  iterative 
static  information  pattern  described  above,  in  Figure  9.  The  state  constraints,  dynamics  and  initial 
conditions  are  the  same  as  in  Figure  4  (the  two-corridor  example). 

Notice  that  between  i  =  16  and  i  =  25,  P  is  forced  to  stay  within  S',  owing  to  P’s  conservative 
nature  in  the  static  game;  indeed,  if  P  commits  down  the  vertical  corridor  C2  at  i  =  16,  then  E 
would  counter  by  reversing  course  and  occlude  behind  the  corner  at  (0,0).  It  is  only  after  E  has 
passed  the  midway  point  of  C2  at  i  =  25,  that  P  deems  his  commitment  down  C2  to  be  worthwhile. 

4.2.1.  Target  tracking.  In  a  realistic  setting,  one  of  P  or  E  may  move  sub-optimally.  In  the  case 
where  E  is  sub-optimal,  the  problem  can  be  seen  as  a  target-tracking  problem,  where  E  is  the 
(moving)  target  and  P  is  the  observer.  This  can  be  modeled  by  modifying  the  formulation  (22)- (24) 
so  that  =  4>(t)  G  A  is  some  arbitrary  prescribed  control  for  E  (the  oblivious  target).  That  is,  at 
each  discrete  time  U,  only  P  chooses  its  optimal  static  game  control  based  on  the  current  states  of 
the  system.  An  numerical  result  of  this  approach  is  shown  in  Figure  8. 

5.  Conclusions  and  future  work 

In  this  article,  we  have  presented  an  efficient  algorithm  to  compute  the  value  and  optimal  controls 
for  a  class  of  pursuit-evasion  games,  called  the  visibility-based  surveillance-evasion  game.  The  novelty 
of  the  algorithm  is  the  decomposition  of  a  game  into  optimal  control  problems,  which  can  be  solved  on 
a  lower  dimensional  domain.  This  decomposition  is  made  possible  by  the  decoupled  player  dynamics 
and  the  so  called  static  information  pattern.  Generalizations  to  inhomogenous  speeds  and  multiple 
evaders  were  presented,  as  well  as  applications  to  target  tracking  and  feedback  control  games. 

The  present  technique  of  solving  the  static  game  using  optimal  control  was  recently  applied  to 
another  problem  in  [THDT12].  There,  the  upper  static  value  for  the  game  known  as  cap ture-the- flag 
[HDZT11]  was  considered.  We  are  currently  investigating  a  more  general  class  of  pursuit-evasion 
type  games  that  can  be  solved  via  a  similar  decomposition  technique. 

We  are  also  investigating  a  more  efficient  method  from  computing  £*(•;  x°E),  which  is  the  bottleneck 
to  our  algorithm  in  terms  of  computation  time.  Preliminary  tests  have  shown  that  for  cases  with 
relatively  small  R*,  the  computation  time  can  be  significantly  reduced.  For  instance,  the  test  case 
shown  in  Figure  8  can  be  produced  at,  on  average,  approximately  3  seconds  per  iteration  (i.e.  for 
each  ti)  using  the  more  efficient  approach.  Another  useful  result,  in  the  spirit  of  Example  2,  would 
be  an  algorithm  that  computes  the  exact  formula  for  t*(-;x^)  in  a  polygonal  domain  for  an  E  with 
a  constant,  isotropic  speed. 
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Figure  8.  A  target  tracking  example.  Both  players  start  at  the  bottom  right 
entrance  to  the  “cave”.  The  target’s  path  (in  red)  is  predefined,  heading  towards  to 
the  cave  exit  at  the  top  right  corner,  the  tracker’s  path  (in  blue)  is  computed  using 
a  sequence  of  controls  computed  by  Algorithm  1  from  the  current  states  at  fixed 
time  intervals.  The  black  line  segment  connects  the  target  and  tracker  states  at  the 
same  discrete  time  instances  U. 
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Appendix  A.  The  level  set  visibility  and  shadow  functions 

In  this  appendix,  we  describe  a  fast  sweeping  type  method  for  constructing  an  implicit  representa¬ 
tion  for  VXQ  for  an  arbitrary  vantage  point  xq  G  flfree-  This  implicit  representation  uses  a  continuous 
level  set  function,  which  we  write  as  £(•;  xq)  :  Q  -A  M,  holding  the  property 

(25)  £0c^o)  <  0  x  G  VXo. 

We  shall  call  £(-;xo)  the  shadow  function  from  the  vantage  point  xo  G  Cl  free-  The  minimum  arrival 
time  function  to  Vx  =  {x  G  £4free  |  £(x;  xo)  <  0}?  denoted  by  £*,  can  then  be  conveniently  computed 
as  per  the  definition  (11),  by  solving  an  eikonal  equation  with  the  boundary  conditions  defined  by  £. 
We  begin  by  assuming  that  the  obstacles  are  represented  as  a  level  set  function  ^  0  -A  M, 

(26)  'ip(x)  <0<^xG  fl0bs- 

We  shall  refer  to  such  a  function  as  an  occluder  function.  Let  T  =  {x  \  !p(x)  =  0}  be  the  boundary 
of  the  obstacles.  For  what  follows,  we  assume  that  T  is  smooth,  and  \/ip(x)  /  0  on  T.  Consequently, 
V^(x)/||V^(x)||  =  n  defines  the  unit  outer  normal  of  fl0bs  on  T. 
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Figure  9.  Snapshots  of  the  iterative  static  game  algorithm. 

Next,  we  adopt  the  framework  of  [TCO+04]  by  defining  the  visibility  function  associated  with 
the  occluder  function  and  the  vantage  point  ro  G  9: 

(27)  <p(x-,x0)=  min  ip(y), 

yeC(x,x  0) 

where,  we  take  C(x,xo)  as  the  line  segment2  between  x  and  x$.  Note  that,  by  definition,  ip(x,y)  = 
(p(y,x).  Using  this  function,  we  define  the  visibility  of  x  from  the  vantage  point  x^: 

(28)  cp(x]  xo)  >  0  x  is  visible  from  x$. 

A  graphical  illustration  of  the  zero  sub-level  set  of  <^(-,  xo)  is  shown  in  Figure  11  (b). 

Remark  A.l.  We  point  out  that,  while  it  may  appear  that  f(x\x o)  =  max{(/?(x;  xq),  —  ^(x)}  is  a 
candidate  definition  for  the  shadow  function,  it  does  not  satisfy  (25).  Observe  that  f(x\x o)  =  0  for 
all  x  G  T  and  in  general,  V  (£_  VXo  U  dVXo ;  consider  the  points  x  G  T  that  are  “visible”  from  the 
vantage  point,  i.e.  (xo  —  x)  •  V^(t)  >  0.  See  also  Case  V  below. 

We  now  describe  a  method  for  constructing  the  shadow  function  £(-;xo)  by  performing  a  few 
Boolean  operations  on  sets  defined  by  some  auxiliary  level  set  functions.  Fix  a  vantage  point  tq  G 


2In  general,  £(x,  xo)  is  the  curve  representing  the  path  of  a  ray  of  light  connecting  x  and  xq  influenced  by  refraction 
and/or  reflection. 
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Dfree.  Define  the  auxiliary  function  g(-;x o) :  D  -A  M  as 

#0;  t0)  =  max{^(r),  (t0  -  x)  •  V^>(t)}. 

Thus  g  describes  the  portions  of  <9D0bs  that  cannot  be  visible  from  xq.  Let  g(-\x o) :  D  M  be  the 
visibility  function  associated  with  the  occluder  function  2p(x)  =  #(t;to).  Then,  a  candidate  shadow 
function  is  given  by 

(29)  £0*5  ^o)  =  max{-'tl>(x),g(x]x0)}. 

We  illustrate  the  auxiliary  function  g  and  the  shadow  function  £  in  Figure  11  (c)  and  (d).  Note 
that,  this  construction  of  £  involves  solving  the  visibility  function  twice  and  a  pointwise  maximum 
operation  twice.  Thus,  the  calculation  is  O(N)  for  on  a  mesh  with  N  grid  points. 

Now,  we  show  that  (25)  is  satisfied  by  considering  all  possible  cases  for  x  G  Q.  For  the  purpose 
of  justifying  the  construction  of  £  above,  we  define  a  line  7:  [0,  s\  -A  D,  parametrized  by  arc  length, 
where 

C(x, x0)  =  (7(5)  I  s  e  [0,s]},  7(0)  =  xo,j(s)  =  X. 

If  x  G  VXo ,  we  also  define  81, 82  G  [0,  s]  such  that  7(81)  is  the  first  entry  point  into  D0bs  and  7(82) 
is  the  first  exit  point  from  D0bs- 

81  =  argmin  {7(5)  G  Dobs},  s2  =  arS  min  {t(5)  £  r}  >  si- 

se[0,s]  sG[si,s| 

See  Figure  10  depicting  7  on  a  simple  example.  Note  that,  if  s  1  =  s2,  then  £(t,to)  is  tangent  to 
^obs  at  7(^1)* 


T0  =  7(0) 


Figure  10.  An  example  illustrating  £(t,t 0)  =  {7 (s)  \  s  G  [0,  s ]}  where  7(^0)  =  To 
is  the  vantage  point. 

Thus,  the  consistency  of  (29)  with  (25)  can  be  justified  by  considering  the  following  five  cases  for 
tgH: 

Case  I:  t  G  int(D0bs)-  Since  ^(t)  <  0  we  have  £(t;to)  >  0,  as  desired. 

Case  II:  t  0  VXQ .  In  this  case,  we  have  C{ t,  to)  C  fifree,  so  g(y ;  To)  >  ip(y)  >  0  for  all  y  G  £( t,  to). 

Thus,  ^(t;to)  >  0.  By  (29),  we  conclude  that  £(t;to)  >  0,  as  desired. 

Case  III:  t  G  T>Xo.  Note  that  at  y  =  y(s2)  G  T,  we  have  (to  —  y)  •  \/^{y)  <  0  and  ^(y)  =  0. 
Therefore,  g(y;  To)  =  0  and 

9( x;  x0)  =  min  ^(7(5);  t0)  <  g(y\ t0)  =  0. 

se[o,s] 

Furthermore,  since  —^(y)  <  0,  we  have  that  £(t;to)  <  0  as  desired. 

Case  IV:  t  G  TC\dVXQ.  Here,  we  necessarily  have  x  =  7(82).  Then,  by  the  same  argument  as  Case 
III,  we  have  #(t;to)  =  #(t;to)  =  ^(t)  =  0.  Thus,  £(t;to)  =  0  as  desired. 

Case  V:  t  G  T\dVXo.  In  this  case,  x  =  7(81)  and  81  <  82.  Then,  (to  —  t)  •  W/ijj(x)  >  0,  so 
#(t;to)  >  0.  Furthermore,  note  that  <7(7(8);  To)  >  ^(7 (s))  >  0  for  s  G  [0,  81).  Therefore, 
^(t;tq)  >  0,  and  we  conclude  that  £(t;tq)  >  0. 
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(a)  Obstacle  function 


(c)  Auxiliary  function 


i 


(d)  Shadow  function 


Figure  11.  Illustration  of  the  construction  of  the  shadow  function  in  M2.  The 
shaded  area  is  where  each  function  is  negative;  the  small  square  on  the  top  left  of 
each  plot  is  the  vantage  point  x$.  The  boundary  of  the  obstacles  T  are  shown  as 
black  contours,  (a)  The  obstacle  function  (b)  visibility  function  (/?(•,  To),  (c)  the 
auxiliary  function  g(-,x o),  and  (d)  the  shadow  function  £(-,#0)- 
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